20 research outputs found

    Cyberbullying Detection on Twitter Using Natural Language Processing and Machine Learning Techniques

    Get PDF
    People use social media to engage and debate themes ranging from entertainment to sports to politics and many others. The use of social media has also resulted in an increase in cyberbullying, which is occurring at an alarming pace. Many cyberbullying messages may be found in the comment sections of many social media platforms, including Twitter, YouTube, and others. Cyberbullying has the ability to cause stress and mental distress, which should be detected early and avoid being published on social media platforms. In this study, we provide a system for detecting cyberbullying messages in English using natural language processing (NLP) and machine learning approaches. On Twitter, a total of 16851 tweets were gathered. The dataset was applied to an NLP approach to find the most offensive terms associated with cyberbullying. Based on our NLP results, it was clear that cyberbullying happens and must be addressed as soon as possible. The dataset was also utilized to train the random forest (RF) and support vector machine (SVM) algorithms. Random forest surpassed support vector machine, which attained an accuracy of 90.5%, with 98.5%. With careful attention to data preparation, where missing and outlier values are dealt beforehand, the high percentage of the model is obtained. This method facilitates the analysis of the available data at the expense of the study's statistical power and ultimately the validity of its findings. Additionally, it aids in producing a significant bias in the outcomes and increases the effectiveness of the data. The Root mean square error and mean square error were used to analyse the results. In comparison to the support vector machine, the random forest earned the best error score. Our findings may be utilized by agencies and groups to educate individuals about the proper use of social media in order to avoid cyberbullying

    Trade-off assessments between reading cost and accuracy measures for digital camera monitoring of recreational boating effort

    Get PDF
    Digital camera monitoring is increasingly being used to monitor recreational fisheries. The manual interpretation of video imagery can be costly and time consuming. In an a posteriori analysis, we investigated trade-offs between the reading cost and accuracy measures of estimates of boat retrievals obtained at various sampling proportions for low, moderate and high traffic boat ramps in Western Australia. Simple random sampling, systematic sampling and stratified sampling designs with proportional and weighted allocation were evaluated to assess trade-offs in terms of bias, accuracy, precision, coverage rate and cost in estimating the annual total number of powerboat retrievals in 10,000 jackknife resampling draws. The relative standard error (RSE ± standard deviations) obtained by the sampling designs for sampling proportions from 0.4 onwards were below a 20 % threshold for three of the sampling designs across the three boat ramps. Coverage rates of over 90 % were observed for the confidence intervals for the estimated annual number of powerboat retrievals, with low relative standard errors (RSE \u3c 20 %). Interpreting 40 % of camera footage within a year provided the minimum level to obtain sufficient accuracy measures for all sampling designs considered. The stratified random sampling design with weighted allocation consistently resulted in the smallest variance for estimates of annual powerboat retrievals across the various sampled proportions. These findings have the potential to considerably reduce the cost of manual data interpretation, since operating cost increased linearly with increasing sampling proportion

    Imputation of missing data from time-lapse cameras used in recreational fishing surveys

    Get PDF
    While remote camera surveys have the potential to improve the accuracy of recreational fishing estimates, missing data are common and require robust analytical techniques to impute. Time-lapse cameras are being used in Western Australia to monitor recreational boating activities, but outages have occurred. Generalized linear mixed effect models formulated in a fully conditional specification multiple imputation framework were used to reconstruct missing data, with climatic and some temporal classifications as covariates. Using a complete 12-month camera record of hourly counts of recreational powerboat retrievals, data were simulated based on ten observed camera outage patterns, with a missing proportion of between 0.06 and 0.61. Nine models were evaluated, including Poisson and negative binomial models, and their associated zero-inflated variants. The imputed values were cross-validated against actual observations using percent bias, mean absolute error, root mean square error, and skill score as performance measures. In 90% of the cases, 95% confidence intervals for the total imputed estimates from at least one of the models contained the total actual counts. With no systematic trends in performance among the models, zero-inflated Poisson and its bootstrapping variant models consistently ranked among the top 3 models and possessed the narrowest confidence intervals. The robustness and generality of the imputation framework were demonstrated using other camera datasets with distinct characteristics. The results provide reliable estimates of the number of boat retrievals for subsequent estimates of fishing effort and provide time series data on boat-based activity

    Cardiometabolic syndrome among general adult population in Ghana: The role of lipid accumulation product, waist circumference-triglyceride index, and triglyceride-glucose index as surrogate indicators

    Get PDF
    Background: Visceral obesity and insulin resistance contribute to developing cardiometabolic syndrome (MetS). We investigated the predictive abilities of lipid accumulation product (LAP), waist circumference-triglyceride index (WTI), and triglyceride-glucose (TyG) index for MetS screening among the general Ghanaian adults. Methods: The final prospective analysis included 4740 healthy adults aged 30–90 years from three communities comprising Ejisu, Konongo, and Ashanti Akim Agogo in Ghana. Self-structured questionnaire pretested was used to collect sociodemographic, anthropometric, and clinical data. Blood samples were taken after fasting to measure glucose and lipid levels. LAP, WTI, and TyG were calculated from standard equations. MetS was defined by the International Diabetes Federation criteria. Receiver operating characteristic (ROC) curves and multivariable logistic regression were utilized to evaluate the potential of the three indices in identifying MetS. Results: Of the 4740 participants, 39.7% had MetS. MetS was more common in females (50.3%) than in males (22.2%). Overall, LAP ≥ 27.52 yielded as the best index for MetS with the highest area under the ROC curve (AUC) (0.866). At cut-off LAP point of ≥ 23.87 in males and ≥ 33.32 in females, an AUC of 0.951 and 0.790 was identified in MetS prediction, respectively. LAP was an independent risk measure of MetS for both males (45.6-fold) and females (3.7-fold) whereas TyG was an independent risk measure for females (3.7-fold) only. Conclusions: MetS is increasing among the general adult population. LAP and TyG are important sex-specific risk measures to screen for MetS among the general adult population in our cohort

    Unrecognized hypertension among a general adult Ghanaian population: An urban community-based cross-sectional study of prevalence and putative risk factors of lifestyle and obesity indices

    Get PDF
    Hypertension (HTN) is the leading cause of cardiovascular diseases. Nevertheless, most individuals in developing countries are unaware of their blood pressure status. We determined the prevalence of unrecognized hypertension and its association with lifestyle factors and new obesity indices among the adult population. This community-based study was conducted among 1288 apparently healthy adults aged 18–80 years in the Ablekuma North Municipality, Ghana. Sociodemographic, lifestyle characteristics, blood pressure and anthropometric indices were obtained. The prevalence of unrecognized HTN was 18.4% (237 / 1288). The age groups 45–54 years [aOR = 2.29, 95% CI (1.33–3.95), p = 0.003] and 55–79 years [aOR = 3.25, 95% CI (1.61–6.54), p = 0.001], being divorced [aOR = 3.02 95% CI (1.33–6.90), p = 0.008], weekly [aOR = 4.10, 95% CI (1.77–9.51), p = 0.001] and daily alcohol intake [aOR = 5.62, 95% CI (1.26–12.236), p = 0.028] and no exercise or at most once a week [aOR = 2.25, 95% CI (1.56–3.66), p = 0.001] were independently associated with HTN. Among males, the fourth quartile (Q4) of both body roundness index (BRI) and waist to height ratio (WHtR) [aOR = 5.19, 95% CI (1.05–25.50), p = 0.043] were independent determinants of unrecognized HTN. Among females, the third quartile (Q3) [aOR = 7.96, 95% CI (1.51–42.52), p = 0.015] and Q4 [aOR = 9.87 95% CI (1.92–53.31), p = 0.007] of abdominal volume index (AVI), the Q3 of both BRI and WHtR [aOR = 6.07, 95% CI (1.05–34.94), p = 0.044] and Q4 of both BRI and WHtR [aOR = 9.76, 95% CI (1.74–54.96), p = 0.010] were independent risk factors of HTN. Overall, BRI (AUC = 0.724) and WHtR (AUC = 0.724) for males and AVI (AUC = 0.728), WHtR (AUC = 0.703) and BRI (AUC = 0.703) for females yielded a better discriminatory power for predicting unrecognized HTN. Unrecognized hypertension is common among the apparently healthy adults. Increased awareness of its risk factors, screening, and promoting lifestyle modification is needed to prevent the onset of hypertension

    Prevalence of preeclampsia and algorithm of adverse foeto-maternal risk factors among pregnant women in the central region of Ghana: A multicentre prospective cross-sectional study

    Get PDF
    Background: Preeclampsia is a leading cause of foeto-maternal deaths especially in Sub-Saharan Africa. However, the prevalence and risk factors of preeclampsia are scarce in the Central region of Ghana with previous study assessing individual independent risk factors. This study determined the prevalence and algorithm of adverse foeto-maternal risk factors of preeclampsia. Methods: This multi-centre prospective cross-sectional study was conducted from October 2021 to October 2022 at the Mercy Women’s Catholic Hospital and Fynba Health Centre in Central region, Ghana. A total of 1,259 pregnant women were randomly sampled and their sociodemographic, clinical history, obstetrics and labour outcomes were recorded. Logistic regression analysis using SPSS version 26 was performed to identify risk factors of preeclampsia. Results: Of the 1,259 pregnant women, 1174 were finally included in the study. The prevalence of preeclampsia was 8.8% (103/1174). Preeclampsia was common among 20–29 years age group, those who had completed basic education, had informal occupation, multigravida and multiparous. Being primigravida [aOR = 1.95, 95% CI (1.03–3.71), p = 0.042], having previous history of caesarean section [aOR = 4.48, 95% CI (2.89–6.93), p \u3c 0.001], foetal growth restriction [aOR = 3.42, 95% CI (1.72–6.77), p \u3c 0.001] and birth asphyxia [aOR = 27.14, 95% CI (1.80–409.83), p = 0.017] were the independent risk factors of preeclampsia. Pregnant women exhibiting a combination of primigravida, previous caesarean section and foetal growth restriction were the highest risk for preeclampsia [aOR = 39.42, 95% CI (8.88–175.07, p \u3c 0.001] compared to having either two or one of these factors. Conclusion: Preeclampsia is increasing among pregnant women in the Central region of Ghana. Pregnant women being primigravida with foetal growth restriction and previous history of caesarean section are the highest risk population likely to develop preeclampsia with neonates more likely to suffer adverse birth outcome such as birth asphyxia. Targeted preventive measures of preeclampsia should be created for pregnant women co-existing with multiple risk factors

    Association between micronutrients, oxidative stress biomarkers and angiogenic growth mediators in early and late-onset preeclamptic Ghanaian women

    Get PDF
    Objectives: Micronutrients, especially calcium (Ca) and magnesium (Mg) are reported to reduce preeclampsia events via several factors such as endothelial cell control, optimal oxidative stress and a balanced angiogenic growth mediator. We evaluated the association of micronutrients with oxidative stress biomarkers, and angiogenic growth mediators in early-onset preeclampsia and late-onset preeclampsia. Methods: This case-control study recruited 197 preeclampsia (early-onset preeclampsia = 70 and late-onset preeclampsia = 127) as cases and 301 normotensive pregnant women as controls from the Komfo Anokye Teaching Hospital, Ghana. Samples were collected after 20 weeks of gestation for both cases and controls and estimated for Ca, Mg, soluble fms-like tyrosine kinase-1, placental growth factor, vascular endothelial growth factor-A, soluble endoglin, 8-hydroxydeoxyguanosine, 8-epiprostaglandinF2-alpha and total antioxidant capacity. Results: Early-onset preeclampsia women had significantly lower levels of Ca, Mg, placental growth factor, vascular endothelial growth factor-A and total antioxidant capacity but higher levels of soluble fms-like tyrosine kinase-1, soluble endoglin, 8-epiprostaglandinF2-alpha, 8-hydroxydeoxyguanosine, soluble fms-like tyrosine kinase-1/placental growth factor ratio, 8-epiprostaglandinF2-alpha /placental growth factor ratio, 8-hydroxydeoxyguanosine/placental growth factor ratio and soluble endoglin/placental growth factor ratio than late-onset preeclampsia and normotensive pregnant women (p \u3c 0.0001). Among the early-onset preeclampsia women, the first and second quartile for serum placental growth factor, first quartile for vascular endothelial growth factor-A and total antioxidant capacity and the fourth quartiles for serum sEng, serum sFlt-1, 8-epiPGF2 and 8-OHdG were independently associated with low Ca and Mg (p \u3c 0.05). Among late-onset preeclampsia women, the fourth quartile for soluble fms-like tyrosine kinase-1 was independently associated with low Ca and Mg (p \u3c 0.05). Conclusion: Magnesium and calcium are associated with an imbalance in angiogenic growth mediators and oxidative stress biomarkers among preeclampsia women, particularly early-onset preeclampsia. Serial and routine measurement of these micronutrients would allow the monitoring of poor placental angiogenesis while enabling an understanding of the triggers of increased oxidative stress and reduced antioxidant in preeclampsia

    Coagulation factors and natural anticoagulants as surrogate markers of preeclampsia and its subtypes: A case-control study in a Ghanaian population

    Get PDF
    Preeclampsia (PE) is associated with endothelial injury and hemostatic abnormalities. However, the diagnostic role of coagulation parameters and natural anticoagulants in predicting PE has not been explored in Ghana. This study assessed plasma levels of these factors as surrogate markers of PE and its subtypes. This case-control study included 90 women with PE (cases) and 90 normotensive pregnant women (controls). Blood samples were drawn for the estimation of complete blood count and coagulation tests. The prothrombin time (PT), activated partial thromboplastin time (APTT), and the calculation of the international normalized ratio (INR) were determined by an ACL elite coagulometer while the levels of protein C (PC), protein S (PS), antithrombin III (ATIII), and D-dimers were also measured using the solid-phase sandwich enzyme-linked immunosorbent assay (ELISA) method. All statistical analyses were performed using the R Language for Statistical Computing. Results showed significantly (p \u3c .05) shortened APTT (28.25 s) and higher D-dimer levels (1219.00 ng/mL) among PE women, as well as low levels of PC (1.02 g/mL), PS (6.58 g/mL), and ATIII (3.99 ng/mL). No significant difference was found in terms of PT and INR. From the receiver operating characteristic analysis, PC, PS, and ATIII could significantly predict PE and its subtypes at certain cutoffs with high accuracies (area under the curve [AUC] ≥ 0.70). Most women with PE are in a hypercoagulable state with lower natural anticoagulants. PC, PS, and ATIII are good predictive and diagnostic markers of PE and its subtypes (early-onset PE [EO-PE] and late-onset PE [LO-PE]) and should be explored in future studies

    Share Prices Data.csv

    No full text
    This data contains the stock prices of companies on the Ghana Stock Exchange</p

    Exchange rate forecasting via three currencies (GHS / USD / GBP)

    No full text
    This data contains exchange rate forecasting the Ghana cedi (GHS) against the British pound (GBP) and United States dollar (USD)</p
    corecore